Acoustic features for robust classification of Mandarin tones
نویسندگان
چکیده
For applications such as tone modeling and automatic tone recognition, smoothed F0 (pitch) all-voiced pitch tracks are desirable. Three pitch trackers that have been shown to give good accuracy for pitch tracking are YAAPT, YIN, and PRAAT. On tests with English and Japanese databases, for which ground truth pitch tracks are available by other means, we show that YAAPT has lower errors than YIN and PRAAT. We also experimentally compare the effectiveness of the three trackers for automatic classification of Mandarin tones. In addition to F0 tracks, a compact set of low-frequency spectral shape trajectories are used as additional features for automatic tone classification. A combination of pitch trajectories computed with YAAPT and spectral shape trajectories extracted from 800ms intervals for each tone results in tone classification accuracy of nearly 77%, a rate higher than human listeners achieve for isolated tonal syllables, and also higher than that obtained with the other two trackers.
منابع مشابه
Acoustic analysis of the neutral tone in Mandarin
East Asian Languages such as Mandarin do have lexical tones in their phonological system. Pronounced in isolation, the fundamental frequency contours produced by these tones are relatively stable and their shapes well described phonetically. However, modifications can occur, not only in the well known case where two consecutive third tones are realized with a tone two tone three sequence, but i...
متن کاملLocal Rhyme-based Acoustic Features for Mandarin Tone Recognition
We investigate the use in Mandarin tone recognition of over two hundred possible local acoustic features based on pitch, overall intensity, and band-passed intensity in the rhyme of a syllable. Features involving pitch height are not as useful as one might expect, showing the need for phrase-level pitch height correction. The intensity contour is useful, particularly when rhyme-initial intensit...
متن کاملAcoustic realization of Mandarin neutral tone and tone sandhi in infant-directed speech and Lombard speech.
Mandarin lexical tones are modified in both infant-directed speech (IDS) and Lombard speech, resulting in tone hyperarticulation. However, it is unclear if these registers also alter contextual tones (neutral tone and tone sandhi) and if such phonetic modification might affect acquisition of these tones. This study therefore examined how neutral tone and tone sandhi are realized in IDS, and how...
متن کاملPerceptual assimilation of lexical tone: The roles of language experience and visual information
Using Best's (1995) perceptual assimilation model (PAM), we investigated auditory-visual (AV), auditory-only (AO), and visual-only (VO) perception of Thai tones. Mandarin and Cantonese (tone-language) speakers were asked to categorize Thai tones according to their own native tone categories, and Australian English (non-tone-language) speakers to categorize Thai tones into their native intonatio...
متن کاملIdentifying isolated, multispeaker Mandarin tones from brief acoustic input: a perceptual and acoustic study.
Lexical tone identification relies primarily on the processing of F0. Since F0 range differs across individuals, the interpretation of F0 usually requires reference to specific speakers. This study examined whether multispeaker Mandarin tone stimuli could be identified without cues commonly considered necessary for speaker normalization. The sa syllables, produced by 16 speakers of each gender,...
متن کامل